Ewens' sampling formula and related formulae: combinatorial proofs, extensions to variable population size and applications to ages of alleles.

نویسندگان

  • Robert C Griffiths
  • Sabin Lessard
چکیده

Ewens' sampling formula, the probability distribution of a configuration of alleles in a sample of genes under the infinitely-many-alleles model of mutation, is proved by a direct combinatorial argument. The distribution is extended to a model where the population size may vary back in time. The distribution of age-ordered frequencies in the population is also derived in the model, extending the GEM distribution of age-ordered frequencies in a model with a constant-sized population. The genealogy of a rare allele is studied using a combinatorial approach. A connection is explored between the distribution of age-ordered frequencies and ladder indices and heights in a sequence of random variables. In a sample of n genes the connection is with ladder heights and indices in a sequence of draws from an urn containing balls labelled 1,2,...,n; and in the population the connection is with ladder heights and indices in a sequence of independent uniform random variables.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Convergence Time to the Ewens Sampling Formula

In this paper, we establish the cutoff phenomena for the discrete time infinite alleles Moran model. If M is the population size and μ is the mutation rate, we find a cutoff time of log(Mμ)/μ generations. The stationary distribution for this process in the case of sampling without replacement is the Ewens sampling formula. We show that the bound for the total variation distance from the generat...

متن کامل

Sampling formulae arising from random Dirichlet populations

Consider the random Dirichlet partition of the interval into n fragments at temperature θ > 0. Some statistical features of this random discrete distribution are recalled, together with explicit results on the law of its size-biased permutation. Using these, pre-asymptotic versions of the Ewens and Donnelly-Tavaré-Griffiths sampling formulae from finite Dirichlet partitions are computed exactly...

متن کامل

Large Deviations Associated with Poisson–dirichlet Distribution and Ewens Sampling Formula

Several results of large deviations are obtained for distributions that are associated with the Poisson–Dirichlet distribution and the Ewens sampling formula when the parameter θ approaches infinity. The motivation for these results comes from a desire of understanding the exact meaning of θ going to infinity. In terms of the law of large numbers and the central limit theorem, the limiting proc...

متن کامل

An Asymptotic Sampling Formula for the Coalescent with Recombination.

Ewens sampling formula (ESF) is a one-parameter family of probability distributions with a number of intriguing combinatorial connections. This elegant closed-form formula first arose in biology as the stationary probability distribution of a sample configuration at one locus under the infinite-alleles model of mutation. Since its discovery in the early 1970s, the ESF has been used in various b...

متن کامل

An exact sampling formula for the Wright-Fisher model and a solution to a conjecture about the finite-island model.

An exact sampling formula for a Wright-Fisher population of fixed size N under the infinitely many neutral alleles model is deduced. This extends the Ewens formula for the configuration of a random sample to the case where the sample is drawn from a population of small size, that is, without the usual large-N and small-mutation-rate assumption. The formula is used to prove a conjecture ascertai...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Theoretical population biology

دوره 68 3  شماره 

صفحات  -

تاریخ انتشار 2005